CLAIMS 

What is claimed is: 



1 LA method, comprising: 

2 determining an identity of a speaker through a network over which output data, 

3 regarding a person with access to a speech-recognition system receiving the output data, 

4 is provided to one or more speech-recognition systems; 

5 attempting to locate, based on the identity of the speaker, a voice model for the 

6 speaker; and 

7 retrieving from a storage area the voice model for the speaker if the voice model 

8 for the speaker is located. 



1 2. The method of claim 1 , wherein the voice model comprises a speaker- 

2 dependent voice model. 
1 

2 3 . The method of claim 2, wherein determining the identity of the speaker 

3 over the network comprises using information received from the speaker over the 

4 network to determine the identity the speaker. 

1 4. The method of claim 2, wherein determining the identity of the speaker 

2 over the network comprises: 

3 receiving from a device in the network identifying data regarding the speaker; and 

4 determining the identity of the speaker based on the identifying data regarding the 

5 speaker. 



042390.P13063 



- 15- 



Express Mail No. EL414998485US 



5 . The method of claim 2, wherein the storage area comprises an internal 
storage area containing speaker-dependent voice models for multiple persons. 

6. The method of claim 2, wherein the storage area comprises an external 
storage area accessible over the network. 

7. The method of claim 2, wherein the output data comprise phonemes. 

8. The method of claim 7, further comprising: 
receiving an utterance from the speaker; 

using the voice model to extract phonemes from the utterance; and 
transmitting the phonemes over the network to the speech-recognition system. 

9. The method of claim 8, wherein the utterance comprises one or both of 
vocalized words and vocalized sounds. 

10. The method of claim 9, further comprising: 

receiving from the speech-recognition system contents of a recognized utterance 
of the speaker; and 

revising the voice model for the speaker based on the contents of the recognized 
utterance. 
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11. The method of claim 2, wherein the output data comprise a voice model 
for the speaker. 

12. The method of claim 1 1 , further comprising transmitting the voice model 
over the network to the speech-recognition system. 

13. The method of claim 2, further comprising 

receiving Aurora features extracted from an utterance of the speaker; 
extracting phonemes from the Aurora features; and 

transmitting the phonemes over the network to a speech recognition system. 

1 4. The method of claim 2, further comprising: 

retrieving a speaker-independent voice model if failing to locate the voice model 

for the speaker; 

receiving an utterance from the speaker; 

using the speaker-independent voice model to extract phonemes from the 
utterance; 

transmitting the phonemes over the network to a speech-recognition system; 
receiving from the speech-recognition system contents of a recognized utterance 
of the speaker; and 

generating a voice model for the speaker based on the contents of the recognized 
utterance. 
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1 1 5 . A method, comprising: 

2 accessing by a speaker a network containing a speech recognition system; 

3 identifying by a first device the speaker based on information provided by the 

4 speaker; 

5 requesting by the first device a speaker-dependent voice model for the speaker 

6 from a voice model database server providing phonemes to any speech recognition 

7 system in the network; 

8 retrieving by the voice model database server the speaker-dependent voice model 
Z 9 from a storage area if the voice model database server locates a speaker-dependent voice 
y 10 model for the speaker; 

J 1 1 connecting by the first device the speaking device with the voice model database 

€1 12 server; 

P 13 prompting by the voice model database server the speaker to provide an utterance; 

^ 14 speaking by the speaker the utterance into the speaking device; 

jf?{ 15 receiving by the voice model database server the utterance; 

16 using by the voice model database server the speaker-dependent voice model to 

17 extract phonemes from the utterance; 

1 8 transmitting by the voice model database server the phonemes over the network to 

19 a speech-recognition system; and 

20 using by the speech-recognition system the phonemes to determine a content of 

21 the utterance. 
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16. The method of claim 15, wherein the storage area comprises a storage area 
within the voice model database server containing speaker-dependent voice models for 
multiple persons. 

1 7. The method of claim 15, wherein the storage area comprises a storage area 
accessible by the voice model database server over the network. 

18. An article of manufacture comprising: 

a machine- accessible medium including thereon sequences of instructions that, 
when executed, cause one or more machines to: 

determine an identity of a speaker through a network over which output data, 
regarding a person with access to a speech-recognition system receiving the output data, 
is provided to one or more speech-recognition systems; 

attempt to locate, based on the identity of the speaker, a voice model for the 
speaker; and 

retrieve from a storage area the voice model for the speaker if the voice model for 
the speaker is located. 

19. The article of manufacture of claim 18, wherein the sequences of 
instructions that, when executed, cause the one or more machines to attempt to locate, 
based on the identity of the speaker, the voice model for the speaker, comprise sequences 
of instructions that, when executed, cause the one or more machines to attempt to locate, 
based on the identity of the speaker, a speaker-dependent voice model for the speaker. 
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1 20. The article of manufacture of claim 19, wherein the sequences of 

2 instructions that, when executed, cause the one or more machines to retrieve from the 

3 storage area the voice model for the speaker if the voice model for the speaker is located 

4 comprise sequences of instructions that, when executed, cause the one or more machines 

5 to retrieve from an internal storage area containing speaker-dependent voice models for 

6 multiple persons the voice model for the speaker if the voice model for the speaker is 

7 located. 



l= 1 21 . The article of manufacture of claim 19, wherein the sequences of 



O 2 instructions that, when executed, cause the one or more machines to retrieve from the 

ILLS 

00 3 storage area the voice model for the speaker if the voice model for the speaker is located 

%a 4 comprise sequences of instructions that, when executed, cause the one or more machines 

1* 5 to retrieve from an external storage area accessible over the network the voice model for 

;H 6 the speaker. 

w 
o 

fU 

1 22. The article of manufacture of claim 19, wherein the sequences of 

2 instructions that, when executed, cause the one or more machines to determine the 

3 identity of the speaker through the network over which the output data, regarding the 

4 person with access to the speech-recognition system receiving the output data, is 

5 provided to the one or more speech-recognition systems comprise sequences of 

6 instructions that, when executed, cause the one or more machines to determine the 

7 identity of the speaker through the network over which phonemes to the one or more 



042390.P 13063 



-20- 



Express Mail No. EL414998485US 



8 speech-recognition systems is provided regarding the person with access to the speech- 

9 recognition system receiving phonemes. 

1 23. The article of manufacture of claim 22, wherein the machine-accessible 

2 medium further comprises sequences of instructions that, when executed, cause the one 

3 or more machines to: 

4 receive an utterance from the speaker; 

5 use the voice model to extract phonemes from the utterance; and 

6 transmit the phonemes over the network to a speech-recognition system. 

:L~Jf 

p 
'i .. ?, 

% l 24. The article of manufacture of claim 23, wherein the machine- accessible 

X 

yj 2 medium further comprises sequences of instructions that, when executed, cause the one 

13 3 or more machines to : 

U 4 receive from a speech-recognition system contents of a recognized utterance of 

W 

Jr{ 5 the speaker; and 

6 revise the voice model for the speaker based on the contents of the recognized 

7 utterance. 

1 25 . The article of manufacture of claim 19, wherein the sequences of 

2 instructions that, when executed, cause the one or more machines to determine the 

3 identity of the speaker through the network over which the output data, regarding the 

4 person with access to the speech-recognition system receiving the output data, is 

5 provided to the one or more speech-recognition systems comprise sequences of 
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6 instructions that, when executed, cause the one or more machines to determine the 

7 identity of the speaker through the network over which the voice model regarding the 

8 person to the one or more speech-recognition systems is provided regarding the person 

9 with access to the speech-recognition system receiving the voice model regarding the 

10 person. 

1 26. The method of claim 19, wherein the machine-accessible medium further 

2 comprises sequences of instructions that, when executed, cause the one or more machines 
13 3 to transmit the voice model over the network to a speech-recognition system. 

5, if 

«P 1 27. The article of manufacture of claim 26, wherein the machine-accessible 

O 

*y 2 medium further comprises sequences of instructions that, when executed, cause the one 

r? 3 or more machines to : 

J7§ 4 retrieve a speaker-independent voice model if failing to locate the voice model for 

Hen* 

ffj 5 the speaker; 

6 receive an utterance from the speaker; 

7 use the speaker-independent voice model to extract phonemes from the utterance; 

8 transmit the phonemes over the network to a speech-recognition system; 

1 receive from the speech-recognition system contents of a recognized utterance of 

2 the speaker; and 

3 generate a voice model for the speaker based on the contents of the recognized 

4 utterance. 
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28. An apparatus, comprising: 

an identification determiner to determine an identification of a speaker through a 
network over which output data, regarding a person with access to a speech-recognition 
system receiving the output data, is provided to one or more speech-recognition systems; 

a voice-model locator to locate a speaker-dependent voice model for the speaker 
based on the identity of the speaker; and 

a voice-model retriever to retrieve the speaker-dependent voice model for the 
speaker from a storage area based on the identity of the speaker. 

29. The apparatus of claim 28, further comprising: 

an utterance receiver to receive an utterance from the speaker; 

a phoneme extractor to extract phonemes from the utterance using the speaker- 
dependent voice model; and 

a phoneme transmitter to transmit the phonemes over the network to a speech- 
recognition system. 

30. The apparatus of claim 26, further comprising: 

a recognized-utterance receiver to receive from a speech-recognition system 
contents of a recognized utterance of the speaker; and 

a voice model reviser to revise the speaker-dependent voice model of the speaker 
based on the contents of the recognized utterance. 
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